An information theoretic approach for using word cluster information in natural language call routing
نویسندگان
چکیده
In this paper, an information theoretic approach for using word clusters in natural language call routing (NLCR) is proposed. This approach utilizes an automatic word class clustering algorithm to generate word classes from the word based training corpus. In our approach, the information gain (IG) based term selection is used to combine both word term and word class information in NLCR. A joint latent semantic indexing natural language understanding algorithm is derived and studied in NLCR tasks. Comparing with word term based approach, an average performance gain of 10.7% to 14.5% is observed averaged over various training and testing conditions.
منابع مشابه
Discriminative training in natural language call routing
In this paper, we show how discriminative training can be used to improve classifiers used in natural language processing, using as an example the task of natural language call routing. In natural language call routing, callers are routed to desired departments based on natural spoken responses to an open-ended “How may I direct your call?” prompt. With vector-based natural language call routin...
متن کاملVector-based Natural Language Call Routing
This paper describes a domain independent, automatically trained natural language call router for directing incoming calls in a call center. Our call router directs customer calls based on their response to an open-ended “How may I direct your call?” prompt. Routing behavior is trained from a corpus of transcribed and hand-routed calls and then carried out using vectorbased information retrieva...
متن کاملCross-Word Arabic Pronunciation Variation Modeling Using Part of Speech Tagging
Speech recognition is often used as the front-end for many natural language processing (NLP) applications. Some of these applications include machine translation, information retrieval and extraction, voice dialing, call routing, speech synthesis/recognition, data entry, dictation, control, etc. Thus, much research work has been done to improve the speech recognition and the related NLP applica...
متن کاملLanguage model adaptation using minimum discrimination information
In this paper, adaptation of language models using the minimum discrimination information criteria is presented. Language model probabilities are adapted based on unigram, bigram and trigram features using a modified version of the generalized iterative scaling algorithm. Furthermore, a language model compression algorithm, based on conditional relative entropy is discussed. It removes probabil...
متن کاملA data visualization and analysis method for natural language call routing system design
We describe a data visualization tool that allows a natural language call routing system designer to browse the data from high level routing target classes down to individual sentences. For each target class, automatic clustering creates groups that cluster similar requests. Relabeling data is much more efficient because a cluster of many sentences, instead of individual sentences, can be relab...
متن کامل